WOW: the Hungarian Deep Web Searcher

نویسندگان

  • Domonkos Tikk
  • Zsolt T. Kardkovács
  • Gábor Magyar
چکیده

This paper summarizes the goals and presents the results of our ongoing research and development project, called “In the Web of Words” (WOW), funded by the National R+D Program in Hungary. The project aims at creating a complex search interface that incorporates — beside the usual keyword-based search functionality—deep web search, Hungarian natural language (NL) question processing, image search support by visual thesaurus. In this paper we focus on system architecture and NL processing. One of the most crucial part of the system is the transformation of NL questions to adequate SQL queries that is in accordance with schema and attribute convention of contracted partner databases. This transformation is performed in three steps: NL question processing, context recognition, and SQL transformation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Entity Recognizer in Hungarian Question Processing

In our ongoing research and development project, called “In the Web of Words” (WoW), funded by the National R+D Program in Hungary, we aim to create a complex search interface that incorporates— beside the usual keyword-based search functionality—(1) deep web search, (2) Hungarian natural language question processing, (3) image search support by visual thesaurus. This paper focuses on a particu...

متن کامل

On the Transformation of Sentences with Genitive Relations to SQL Queries

In our ongoing project called “In the Web of Words” (WoW) we aimed to create a complex search interface that incorporates a deep web search engine module based on a Hungarian question processor. One of the most crucial part of the system was the transformation of genitive relations to adequate SQL queries, since e.g. questions begin with “Who” and “What” mostly contain such a relation. The geni...

متن کامل

Cross-Lingual Image Search on the Web

Most people locate images on the Web by querying image search engines such as Google’s. The images are tagged by the words in their “vicinity”, which limits the ability of a searcher to retrieve them. Although images are universal, an English searcher will fail to find images tagged in Chinese, and a Spanish searcher will fail to find images tagged in English. Cross-lingual homonyms cause probl...

متن کامل

Determining Relevant Deep Web Sites by Query Context Identification

Deep web search requires a transformation between search keywords and semantically described and well-formed data structures. We approached this problem in our “In the Web of Words” (WoW) project by allowing natural language sentence queries and by a context identification method that connects the queries and deep web sites via database information. In this paper we propose a novel SQL based ap...

متن کامل

Anomaly-based Web Attack Detection: The Application of Deep Neural Network Seq2Seq With Attention Mechanism

Today, the use of the Internet and Internet sites has been an integrated part of the people’s lives, and most activities and important data are in the Internet websites. Thus, attempts to intrude into these websites have grown exponentially. Intrusion detection systems (IDS) of web attacks are an approach to protect users. But, these systems are suffering from such drawbacks as low accuracy in ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004